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I.  INTRODUCTION 


One  of  the  difficulties  in  dealing  with  n-dimensional  feature 
vectors  in  pattern  recognition  is  the  identification  of  the  way  in 
which  the  data  is  distributed  in  the  space.  If  this  distribution  is 
known,  then  the  recognition  problem  may  generally  be  solved 
by  use  of  discriminant  functions  which  take  advantage  of  this 
distribution.^  An  excellent  example  of  the  way  in  which  this  may 
be  done  is  the  OLPARS  system  .  ^  There  have  been  a  number  of 
algorithms  presented  in  the  literature  which  offer  displays  in 
two  or  three  dimensions  of  the  distribution  of  n  dimensional  data. 
A  few  of  these  techniques  are  presented  by  Sammon.  ^  The 
majority  of  such  techniques  appear  *o  be  nonlinear.  Sammon, 
Ball,  3  and  Shepard  and  Carroll”  ^resent  several  examples  of 
nonlinear  mappings. 


The  objective  of  the  display  mappings  is  to  preserve  some 
specified  characteristic  of  the  data  distribution.  One  such 
characteristic  which  is  very  desirable  in  pattern  recognition  is 
linear  separability.  The  objective  for  projection  of  two  classes 
of  data,  then,  is  to  maintain  this  interclass  separability.  An 
approach  which  projects  the  data  set  onto  a  plane  oriented  in  such 
a  way  as  to  maximize  this  interclass  separation  has  been  suggested 
by  Sammon.7  This  technique  uses  the  Fisher  Discriminant®  to 
determine  one  of  the  axes  of  projection.  The  second  axis  is 
determined  by  maximizing  the  data  spread  on  the  projecting  plane. 
This  mapping  is  a  linear  one.  Fukunaga  and  Olsen^  present  a 
similiar  approach  which  is  nonlinear  in  nature. 


A  fundamentally  different  approach  to  this  problem  was 
presented  by  Zahn  in  1971.  In  his  paper,  Z&hn  suggests  that 
graph  theoretic  methods  may  be  used  to  detect  and  describe 
data  clustering.  Basically  this  approach  computes  a  minimal 
spanning  tree  for  the  data  clusters.  A  "stylized"  two  dimensional 
drawing  of  such  a  tree  then  gives  some  information  about  data 
distribution,  primarily  inter-point  distances. 

In  all  of  the  techniques  described  above,  no  cognizance  has 
been  made  of  the  functional  interdependence  of  data.  Many  prob¬ 
lems  have  data  which  are  highly  interdependent.  Examples  of  this 
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are  pattern  recognition  problem#  which  use  multivariate 
feature  vectors  to  describe  two  or  three  dimensional  objects  at 
a  succession  of  aspects  or  view  angles. ^ 

In  cases  such  as  these  highly  structured  data  would  be 
expected*  In  particular,  the  n  dimensional  feature  vectors 
describing  a  two  dimensional  object  would  be  expected  to  form  a 
line  trajectory  in  n  space  as  the  two  dimensional  object  is  rotated. 
Similarly  the  trajectory  of  the  description  for  3  dimensional 
objects  should  form  a  surface  as  the  object  is  rotated. 

Although  each  of  the  projection  techniques  described  ar  ve 
are  capable  of  handling  such  data,  no  effort  has  been  made  to 
exploit  these  characteristics. 

In  what  follows,  it  will  first  be  shown  that  the  minimal  spanning 
trees  of  Zahn  contain  a  great  deal  of  information  about  the  dis¬ 
tribution  of  data  of  this  type.  Further  these  trees  will  be  shown  to 
be  indicative  of  the  quantization  level  necessary  for  application  of 
nearest  neighbor  recognition  schemes.  Finally,  a  linear  projection 
scheme  similar  to  that  of  Reference  7  will  be  used  to  display  the 
data  and  confirm  distributions  suggested  by  the  minimal  spanning 
tries. 

II.  BASIC  DEFINITIONS 

Before  proceeding  to  the  algorithms,  some  fundamental 
definitions  are  required.  For  a  more  complete  discussion  of  graph 
theory  see  Reference  13.  A  linear  graph  is  composed  of  a  set,  V, 
of  vertices  or  nodes  and  a  set,  E,  of  vertex  pairs,  called  edges, 
which  describe  the  intersection  of  vertices.  Whenever  a  number, 
called  an  edge  weight,  is  associated  with  each  edge  the  resulting 
linear  graph  is  called  an  edge  weighted  linear  graph.  A  path 
in  a  graph  is  a  sequence  of  edges  joining  two  nodes.  Typically 
a  path  is  described  by  the  sequence  of  vertices  which  define  the 
edges  of  the  path.  Figure  1  shows  an  edge  weighted  linear  graph 
with  a  path  (ABCFD). 
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Fig.  1.  Edge  weighted  linear  graphs  and  some  trees. 


A  circuit  is  a  closed  path  such  as  (DBFD)  in  Fig.  1.  If  a 
graph  has  a  path  between  every  pair  of  nodes,  then  it  is  said  to 
be  connected.  A  strongly  connected  graph  is  a  connected  graph 
in  which  any  pair  of  nodes  is  connected  by  an  edge.  A  tree  is  a 
connected  graph  having  no  circuits  and  a  spanning  tree  of  a  graph 
G  is  a  tree  in  G  which  contains  all  nodes  of  the  graph  G.  An 
example  of  such  a  treo  for  the  graph  of  Fig.  1(a)  is  shown  in 
Fig.  1(b).  Another  spanning  tree  is  shown  in  Fig.  1  (c).  In 
general,  the  total  number  of  trees  for  a  given  graph  is  quite  large. 
For  edge  weighted  linear  graphs  a  useful  subclass  of  all  the 
spanning  trees  is  the  class  of  trees  called  a  minimal  spanning 
trees  defined  as  follows .  Let  the  weight  of  a  tree  be  defined  as  the 
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sum  of  the  weights  of  its  constituent  edges*  Then  a  minimal 
spanning  tree  is  a  spanning  tree  having  the  lowest  weight. 

Figure  1(b)  shows  a  minimal  spanning  tree  or  MST.  lor  the  graph, 
of  Fig.  1(a).  It  should  be  noted  that,  in  general,  a  given  graph 
may  not  have  a  unique  minimal  spanning  tree. 

A  number  of  algorithms  exist  for  computing  the  minimal 
spanning  tree  of  a  graph.  Two  of  these  procedures,  given  in 
1956  and  195  7  by  Krushal*^  and  Prim*'’  respectively,  are  remark¬ 
ably  straight  forward.  Krushal'u  algorithm  begins  be  listing 
the  edges  of  the  graph  G  in  order  of  ascending  weight.  For  example 


this  list  becomes 

for  Fig.  1(a). 

EDGE 

WEIGHT 

SELECTION 

AB 

1 

X 

BC 

2 

X 

BE 

2 

X 

DF 

2 

X 

BG 

3 

X 

FG 

4 

X 

BF 

5 

CE 

6 

BD 

8 

Starting  with  the  topmost  edge  (least  weight)  select  edges  in  order 
in  such  a  way  that  no  circuits  are  formed  with  edges  already 
selected.  The  process  stops  when  n-1  edges  have  been  chosen, 
where  n  is  the  number  of  nodes  in  the  graph.  Following  this 
selection  process,  the  spanning  tree  of  Fig.  1(b)  is  shown  to  be 
an  MST  for  the  graph  of  Fig.  1(a). 

The  algorithm  of  Prim  is  equally  as  simple.  Start  with  any 
node  and  add  the  edge  connected  to  this  node  having  the  least 
weight.  This  edge  taken  with  its  end  points  forms  a  fragment 
tree,  T^.  T^  is  obtained  from  Tj^_^  by  adding  the  shortest  edge 
connecting  a  node  in  G  but  not  in  T^i*  The  procedure  terminates 
on  tree  Tn.^.  Consider  as  an  example  Fig.  1(a).  Applying  this 
process  to  this  graph  the  following  table  summarizes  the  steps. 
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NODES 

EDGE  ADDED 

WEIGHT 

A 

AB 

1 

A.  B 

BC 

2 

A.  B,  C 

BE 

2 

A.  B»  C»  E 

BG 

3 

A,  B,  C,  E,  G 

GF 

4 

A,  B,  C,  E,  G,  F 

DF 

L 

A,  Q,  C,  E,  G,  F,  D 

Stop 

Zahn  shows  in 

his  paper^  a  number  of  ways 

in  which  the 

minimal  spanning  tree  of  a  graph  may  be  used  to  describe  and 

identify  data  clustering.  For  data  which  is  functionally  inter¬ 
dependent.  data  clustering,  of  the  type  suggested  by  Zahn,  will 
occur  only  when  data  samples  are  spaced  far  apart.  In  the  next 
section  this  problem  will  be  defined  more  precisely  and  it  will 
be  shown  that  such  an  observation  may  be  used  to  identify  data 
trajectory  and  help  set  a  quantisation  level  on  the  data  used  to 
represent  an  object  class  for  pattern  recognition  purposes. 

III.  DATA  DISTRIBUTION  INFERENCE  FROM  MST's 

Let  S  =  o-j,  ctt,,  •  •  *,  <rk  be  a  set  of  k  object  classes  which 
are  to  be  recognized.  Associated  vith  each  object  class  is  a  set 
of  n-tuples,  A<^.  called  features,  which  are  taken  to  represent 
object  class  c^,  at  some  given  set  of  aspects  or  view  angles.  For 
the  discussion  to  follov,,  the  aspect  angles  will  be  defined  as  the 
normal  spherical  coordinate  angles  4>  and  8  shown  in  Fig.  2. 
Thus,  the  set  of  points  in  Ac r. 


Fig.  2.  Aspect  angle  convention. 
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represent  points  on  an  n-dimensional  surface  which  is  formed 
by  the  locus  of  the  feature  vectors  as  the  aspect  angles  change. 
(This  assumes  noiseless  feature  measurements.)  The  set  of 
points  Ar  may  now  be  taken  as  the  vertices  of  a  strongly 
connected  graph,  i.e.,  all  points  in  A<r.  are  connected  to  all 
other  points  in  Ac^  by  edges.  Further,  the  Euclidean  distance 
between  any  two  n-cuples  or  features  in  A^,  35*  and  y,  may  be 
taken  as  the  weight  for  the  corresponding,  connecting  edge.  Thus 
A^  represents  an  edge  weighted,  strongly  connected  linear  graph. 

Next  let  be  the  ordered  set  formed  from  A^  by  an 
ordering  induced  on  the  vectors  A^  by  aspect  angles  as  follows 


*  ^  (i)  (0  ^  0)  0)  o.O)  -s  U> 

0)  Xo,  o*  X0,1.  '**»  Xq,  Q,  ***»  Xj^  ***»  ^j,t>  **‘> 


where 

(  i) 

(2)  Xj(  ^  =  f^  (  4>j,  )  and  where 


and 

(3) 

(4)  ef  >  e|.l  . 


Define  an  ordered  4>;  or  subset  of  A<,.,  denoted  by  Sj,  (<r.)  or 

Se|  (Ti),  1  J 


as 


(5) 


S<j>j  (®i>  a  < 


-a  (i)_s  (i) 
Xj.o*  Xj,l» 


0) 

Xj,«r  > 
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(6)  SU(  (o.) 

(i)  -»  (i) 

3  <  xo,  «*  Xl.  <*  ***• 

where 

t?)  \ ‘q  ' 

Ar,  ,  q  ■  0, 1,  2,  •  •  •,  a 

-i  (i) 

w  xP,i  < 

Ap  ,  P  *  0,1,2, - .  p 

Thus 

(9)  A,  *  < 

r 


8  # 


> 


> 


where  £,  is  the  number  of  different  tj>  aspect  angles  in  the  set 
A^. .  Note  that  if  an  object,  has  cylindrical  symmetry  with 
the  axis  of  symmetry  co-linear  with  z-axrs  then 


(1°)  (a-)  = 


®4v  *or  M-  and  v‘ 


and 


(11)  K)  >  . 

In  general  the  ( cr^ )  may  be  considered  as  sets  of  points  in 
n-space  located  on  tne  line  formed  by  the  locus  of  point6  having 
4>  fixed  and  0  varying  over  all  possible  values.  Consider  now  the 
MST  formed  from  (<r^)  as  described  earlier.  For  simplicity  let 


? 


s4>  <V 


Now  if  the  features,  Xj  ,  are  selected  with  0  increments 
sufficiently  close  to  each  other,  then  the  minimal  spanning  tree 
will  be  define_d  by  connecting  X ^  to  X2^  and  X^^^  to  X^1) 
and  so  on  to  X^_\)  to  ,  On  the  other  hand,  if  the  9  incre¬ 

ments  are  fairly  large,  it  is  easily  seen  that  this  minimal 
spanning  tree  will,  in  general,  not  occur.  As  an  example  consider 
Fig.  3(a).  Suppose  that  ten  points,  numbered  1  through  10  are 
chosen  to  represent  the  curve  shown  and  thus  define  (<r  ). 

The  minimal  scanning  tree  for  this  function  is  shown  in  Fig.  3(b). 
Note  that  because  of  the  large  increments  in  0,  the  points  are 
not  connected  consecutively  in  ♦he  MST.  If,  on  the  other  hand, 
the  increments  were  chosen  smaller  as  in  Fig.  3(c)  then  the 
resulting  MST  will  have  the  points  sequentially  spaced  as  shown 
in  Fig.  3(d). 


Quantization  effects  on  the  MST  of  a  data  set 
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As  indicated  by  this  example,  the  MST  may  be  used 
to  indicate  the  degree  of  quantization  in  aspect  angle  required 
to  properly  represent  the  object  class.  This  is,  of  course, 
purely  a  qualitative  measure  of  accuracy  of  representation 
and  is  intended  to  be  nothing  more.  There  is,  however,  more 
information  in  the  MST  than  simply  quantization  levels.  The 
MST  carries  a  good  deal  of  information  about  structure  as  well. 
As  an  example  of  this  the  multifrequency  backscatter  return 
amplitudes  for  the  step  cylinder  described  in  Appendix  A  were 
obtained.  Seven  data  points  in  8-space  were  taken  corresponding 
to  9  of  0°,  30°,  60°,  •••,  180°.  These  points  are  labeled 
1#  ***,  7  respectively.  A  stylized  version  of  the  minimal 

spanning  tree  for  this  data  is  shown  in  Fig.  4(a).  Note  that  the 
data  points  are  not  consecutively  arranged  in  the  MST,  thus 
indicating  a  course  quantization  level.  However,  this  courseness 
makes  possible  the  rough  ident;fication  of  data  trajectory,  since 
the  data  points  should  lie  consecutively  on  the  locus  ol  points 
formed  by  varying  9  through  all  possible  values.  Thus  it  may  be 


Fig.  4.  MST  for  step  cylinder  and  inferred  trajectory. 

inferred  from  the  MST  that  because  points  1  and  7  are  adjacent 
the  trajectory  must  be  folded  back.  This  is,  of  course,  what  one 
expects  since  the  step  cylinder  has  the  same  cross  section  from 
look  angles  6  s  0°  or  6  *  180°.  In  a  similar  way  note  that  points 
3  and  5,  corresponding  to  9  *  60  and  9»  129°  respectively  are 
adjacent  as  would  be  expected  from  the  symmetry  of  the  target. 
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It  may  then  be  seen  that  the  MST  does  indicate  in  some  sense 
the  data  trajectory.  More  to  the  point,  however,  the  minimal 
spanning  tree  allows  one  to  identify  those  aspect  angles  at  which 
the  target  object  shows  a  similar  views  or  cross  sections.  With 
proper  quantization,  this  observation  may  be  carried  to  multiple 
objects.  For  example,  the  data  for  the  cylinder  and  prolate 
spheroid  described  in  Appendix  A  were  combined  to  form  one 
data  set  from  which  the  minimal  spanning  tree  was  computed. 

The  data  points  were  labeled  1  to  10  for  the  cylinder  data  and 
11  to  20  for  the  prolate  spheroid.  The  corresponding  aspect  angles 
were  0»  0°,  10°,  20°,  •••,  90°  in  both  cases.  Figure  5  shows 
the  corresponding  MST  and  the  inferred  distribution.  These 
figures  suggest  that  at  an  aspect  angle  of  something  like  40°  to  50° 


Fig.  5.  MST  and  inferred  trajectory  for  combined  prolate 
spheroid  and  cylinder  data. 
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the  two  objects  appear  more  alike  than  at  other  aspects.  It 
does  not  imply  that  the  targets  look  the  same;  only  that  they 
look  more  alike  at  this  aspect  than  others.  It  will  be  shown 
below  that  the  inferred  trajectories  are  probably  correct. 

Before  discuosing  the  linear  transformation  of  the  data,  a 
few  comments  or.  application  of  the  minimal  spanning  tree  are 
in  order.  One  of  the  very  useful  pattern  recognition  techniques 
is  application  of  a  nearest  neighbor  rule  for  identification.  One 
such  _rule  identifies  an  unknown  vector,  as  belonging  to  class 
cr-  if  X  is  nearer  some  vector  in  A<r,  a,  than  any  vector  in  any- 
other  Ao^ .  Two  pieces  of  information  are  thus  obtained.  The 
first  is  object  identification.  The  second  is  object  orientation 
with  respect  to  the  viewer.  If  the  aspect  angle  quantization  is 
small  enough  then  the  target  orientation  is  easily  and  accurately 
obtained  by  assigning  it  as  the  orientation  associated  with  vector 
a*.  Dudani^  effectively  does  this  in  his  recognition  schemes 
using  moment  invariants.  If  on  the  other  hand,  this  quantization 
is  fairly  course  then  the  accuracy  of  the  orientation  estimate 
is  questionable.  For  example  assume  the  point  to  be  identified 
is  the  one  marked  X  in  Fig.  4(b).  This  point  is  closer  to 
point  2  than  5  and  this  would  be  assigned  an  orientation  corres¬ 
ponding  to  point  2.  In  fact,  X  is  closer  to  a  point  on  the  line 
connecting  point  5  and  6  than  it  is  to  either  point  2  or  5 .  If  the 
quantization  had  been  smaller  the  error  in  orientation  could  thus 
have  been  avoided.  To  set  this  quantization  one  needs  only  to 
repeatedly  decrease  the  0  increment  until  the  minimal  spanning 
tree  has  contiguous  points  in  contiguous  locations  in  the  tree. 

It  should  be  noted  that  this  quantization  may  be  set  locally  rather 
than  the  globally,  i.e.,  the  minimum  0  increments  required  may 
be  different  for  different  regions  in  the  data  trajectory. 

Such  a  quantization  level  is  too  fine,  however,  for  investiga¬ 
tion  of  data  trajectory.  The  reason  for  this  is  that  the  minimal 
spanning  tree  is  simply  a  listing  of  interconnected  nodes  and  is 
not  an  actual  visual  display.  Thus  a  fine  quantization  s  unply  says 
that  point  1  is  connected  to  point  2  and  2  is  connected  to  3  and  so 
on  which  was  known  to  begin  with.  By  making  a  fairly  course 
quantization,  the  points  in  the  MST  may  no  longer  be  located 
consecutively  and  thus  give  information  concerning  the  data 
trajectory  by  inference  form  these  noncontiguous  points. 
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IV.  A  TWO  DIMENSIONAL  LINEAR  PROJECTION 


The  use  of  the  minimal  spanning  tree  to  identify  data 
trajectory  is  qualitative  by  nature  and  allow*  for  a  good  deal 
of  subjective  analysis.  What  is  needed  then  is  some  approach 
which  may  be  used  to  supply  some  form  of  quantitative  infor¬ 
mation  about  data  distribution.  This  approach  should  not 
introduce  appreciable  distortion  in  the  display  and  therefore 
probably  should  be  linear.  For  the  case  of  two^data  classes 
a  very  nice  approach  was  proposed  by  Sairmon  using  the  Fisher 
discriminant8.  This  approach  is  not  readily  adaptable  to  a  one 
class  problem.  In  order  to  handle  such  a  problem  a  new  approach 
is  needed. 

Basically  the  main  requirement  on  the  projection  plane  is 
that  the  projected  data  be  spread  out  on  the  plane  as  much  as 
possible,  i.e<»  the  standard  deviation  should  be  as  large  as 
possible.  One  procedure  by  which  this  ideal  situation  may 
partially  be  accomplished  is  described  below.  The  choice  of 
the  algorithm  has  been  devised  to  meet  two  requirements.  First, 
it  should  give  a  fairly  large  projected  data  spread.  This  spread 
need  not  be  maximal  in  any  sense.  Secondly,  and  most  impor¬ 
tantly,  the  projection  plane  axis  should  be  easily  determined. 

Thus  the  following  algorithm  is  heuristicly  derived. 

One  way  in  which  a  projection  hyperplane  may  be  chosen  so 
as  to  produce  a  large  spread  in  the  data  is,  at  least  intuitively, 
to  project  the  n  dimensional  data  points  into  some  two  dimensional 
subspace  of  an  n-1  dimensional  hyperplane  oriented  along  the 
direction  of  data  spread.  The  hyperplane  through  the  data  spread 
may  be  thought  of  in  the  same  way  as  one  thinks  of  the  plane  of 
the  Milky  Way.  More  specifically,  this  plane  iB  defined  by  the 
vector  such  that  Sj  is  minimized  where 

(13)  zx  -  2s  [  -  p)  *  -(i)  J2 

A  i  «1 
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and  where* 


(H)  6  =  i  Ag.  | 

(15)  p  =  —  2*  (i) 

for  all  a  <  Aa  . 


The  minimization  problem  {13)  will  be  trivally  solved  by  the 
zero  vector  so  some  constraint  needs  to  be  added  to  insure  a 
nonzero  solution.  In  the  vast  majority  of  cases,  the  spread 
hyperplane  will  not  contain  the  first  moment  vector  p,  i.e., 
most  of  the  time  "p  *  u  *  0.  Thus  a  constraint  of  pT  •  w '  '  *  1 
may  bs  added  to  (13)  using  a  Langrange  multiplier  to  produce  the 
following  optimization  problem. 

s  _(i)  M)  2  U)  . 

(16)  Min  z^  =  ,21  [  (o  -  p  )  •  u>  ]  +  (1  -  w  •  p  ) 


By  setting  the  partial  derivatives  of  *\  with  respent  to  and 
uy  i  =  1,2,  *,  n  to  zero  and  solving  the  resulting  system  of 

equations,  w  may  be  obtained. 


The  next  problem  is  to  determine  an  appropriate  two 
dimensional  subspace  of  the  hyperplane  defined  by  or  .  One  way 

chosen  here  arbitarily)  is  to  define 


of  doing  this  (and  the  way  ehc 

an  n-1  orthonormal  basis  w  ['*} 

-(1)  -j(2)  ...  rfCn) 


u 


(n) 


in  w  U)  such  that  the 
vectors  w  **',  C'"'  are  mutually  perpendicular.  This 

may  be  done  iteratively  as  follows. 


*  The  order  of  a  set  X  is  denoted  by 
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(17)  Min  S(£  =  .£*  [  (o^  -  tf)  •£(*))]2  +  <l-£  (k  )  •?) 


K  -1 

+  r 

i-i 


ki 


2.3. 


n 


The  system  of  equations  arising  by  setting  to  zero  tho  appro¬ 
priate  partial  derivative  of  z  become 


(18) 


<P 

0 


H 


(i) 


'ii 


cln  *  Pi  W1 


Pi  *Z 


w, 


nl 


-Pi 


w 


U) 

I 


J2) 

W1 


cnn  -  Pn  un 


r  n 
(1) 
wn 

(2) 

u  ' 


’w. 


U  -if 

"  Ik  )’ 

W1 

u-i) 

(k  ) 
w2 

• 

a  -1) 

• 

• 

w(*  > 
n 

U-D  U-l) 
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A  solution  to  (IB)  thus  gives  the  x  ^  vector  w  . 

Tht  two  dimensional  projection  plane  is  finally  taken  as  defined 
by  and  w  (h’1).  The  projection  is  then  made  as 

(19)  *  a  •  u(n) 

(20)  Yj  =  a(i)  •  w  (n_1) 

for  i  *  1,  2,  •  *  * ,  S.  i .  e . ,  for  all  t  A^. . 


Figures  6,  7,  and  8  show  the  data  for  the  step  cylinders 
prolate  spheroid  and  the  cylinder  respectively  as  projected  onto 
planes  derived  as  just  described.  Note  that  the  data  trajectory 
is  clearly  shown  and  further  that  the  trajectory  of  the  step 
cylinder  data  is.  indeed,  similar  to  the  inferred  trajectory  as 
shown  in  Fig.  4(b). 

If  two  classes  exist  then  the  above  procedure  may  be  used 
for  display  after  suitable  modification.  In  this  case,  normally, 
the  important  criteria  is  maintenance  of  linear  separation  in 
the  projected  image.  Maximizing  the  data  spread  thus  becomes 
secondary.  To  accomplish  these  goals  1 3^  may  be  taken  as  a 
separating  hyperplane  for  the  two  classes  or  some  "near" 
separating  hyperplane  if  the  classes  are  not  linearly  separable. 
Applying  the  procedure  described  above,  the  projection  axes 
then  are  taken  as  w  M  and  w in  equations  (19)  and  (20) 
respectively.  The  hyperplane  w W  may  be  found  in  a  number  of 
ways  such  as  the  Fisher  discriminant  mentioned  earlier,  linear 
programming^,  Qr  an  adoptive  approach  such  as  described  in 
Reference  11.  Using  the  latter,  the  above  projection  technique 
was  applied  to  the  combined  prolate  spheroid/cylinder  data. 

The  resulting  projection  is  shown  in  Fig.  9.  Note,  first,  thnt 
the  separation  of  the  two  classes  is  clearly  shown  and,  secondly, 
the  similarity  to  the  inferred  trajectories  of  Fig.  5(b).  As  a 
further  reference,  compare  Fig,  3  with  the  individual  trajectories 
for  the  cylinder  and  prolate  Bpheroid  shown  respectively  in 
Fig  •  7  and  8  . 
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Fig.  6.  Step  cylinder  data  projection. 


I 

i 

i 

1 


V.  CONCLUDING  REMARKS 


This  report  has  dealt  with  two  methods  o£  multivariate  data 
analysis  which  have  been  effectively  used  to  represent  functionally 
related  data  structures.  Taken  together  they  offer  the  user  a 
method  of  visualizing  data  structure  and  thereby  providing  some 
insight  into  the  problem  incountered  in  attempting  to  classify  or 
recognize  an  unknown  target  object. 

In  particular  there  arc  two  areas  related  to  pattern  recogni¬ 
tion  where  a  thorough  knowledge  of  data  trajectory  becomes  of 
great  value.  The  first  has  to  do  with  the  actual  identification  of  the 
target  object  from  a  knowledge  of  the  target  data  trajectory.  By 
comparing  this  unknown  trajectory  with  a  set  of  known  ones  an 
identification  may  be  possible  on  the  basis  of  similar  trajectory 
shapes.  This  becomes  especially  important  in  cases  where 
there  is  a  significant  overlap  in  data  between  two  or  more  object 
classes.  Much  work  is  yet  to  be  done  in  this  area. 

The  second  application  area  hns  to  do  with  the  identification  of 
target  aspect  from  a  knowledge  cf  target  identification  and  target 
data  trajectory.  By  matching  up  the  measured  trajectory  with 
some  portion  of  the  known  data  trajectory  to  obtain  a  best  fit  the 
temporal  orientation  of  the  target  may  be  inferred,  i.e.,  one  can 
identify  the  trajectory  of  the  target  object  relative  to  the  observer. 

From  these  applications,  it  is  clear  that  a  knowledge  of  data 
trajectory  allows  one  to  obtain  greater  confidence  in  a  given 
identification  as  well  as  help  in  choosing  a  particular  recognition 
algorithm  to  be  used  with  a  particular  data  set.  The  data  analysis 
algorithm  described  here  is  intended  to  help  in  such  determinations. 
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